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We have determined the nucleotide sequence of the 
Escherichia coli fepA gene, which codes for the outer 
membrane receptor for ferrienterochelin and colicins 
B and D. The predicted FepA polypeptide has a molec- 
ular weight of 79,908 and consists of 723 amino acids. 
A 22 -amino acid leader or signal peptide preceded the 
mature protein. With respect to overall composition, 
hydropathy, net charge and distribution of nonpolar 
segments, the FepA polypeptide was typical of other E. 
coli outer membrane proteins, except that FepA con- 
tained 2 cysteine residues. Comparison of the deduced 
amino acid sequence of FepA with that of three other 
TonB -dependent receptors (BtuB, FhuA, and IutA) re- 
vealed only a few regions of sequence homology; one 
of these included the amino- termini. An amino acid 
substitution within the conserved amino-terminal re- 
gion of BtuB resulted in production of a receptor that 
had normal binding functions but was incapable of 
energy-dependent transport of vitamin Bi 2 . This result 
suggests that the amino-terminal end of these four 
polypeptides is involved in interaction with the TonB 
protein or another step of energy transduction. Three 
other regions of homology were shared among the four 
proteins: one near residues 50 to 70, another at about 
residue 100 to 140, and the last between 20 and 40 
amino acid residues from the carboxyl terminus. The 
function of these three regions remains speculative. 



At physiological pH iron is sequestered as insoluble com- 
plexes and therefore is not readily available to cells. Many 
organisms, including Escherichia coli, extract iron from the 
environment by synthesizing and excreting iron chelators" 
known as siderophores (for reviews see Refs. 1 and 2). Esch- 
erichia coli normally produces the siderophore known as en- 
terochelin or enterobactin. Enterochelin is a cyclic trimer of 
2,3-dihydroxy-iV-benzoyl-L-serine and its synthesis is regu- 
lated by the intracellular iron supply. 

In addition to making enterochelin, iron-starved cells pro- 
duce a number of proteins some of which are directly involved 
in the transport of iron chelates (2). FepA is an 81,000-dalton 
E. coli outer membrane protein that functions in the initial 
step of iron uptake by binding ferrienterochelin (3). The 
subsequent transport of ferrienterochelin across the outer 
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membrane, presumably into the peripiasmic space, requires 
TonB function, which seems to provide energy for a number 
of outer membrane receptor-dependent processes. The spe- 
cific steps of iron transport from the periplasm into the 
cytoplasm are unclear but involve the products of the fepB 
and fepC genes 1 (4), probably in uptake across the cytoplasmic 
membrane, and the protein product of fes (ferrienterochelin 
esterase) which cleaves ferrienterochelin to allow release of 
its iron (5). Another outer membrane protein produced by 
wild-type E. coli under iron stress is FhuA, a 76,000-dalton 
protein that serves as the receptor for the hydroxamate sid- 
erophore ferrichrome, colicin M, and phages T5 and 080 (6). 
Although desferri-ferrichrome is produced by the fungus Us- 
tilogo sphaerogena, it can be used by E. coli as an iron carrier. 
Strains of E. coli harboring the ColV plasmid excrete the 
siderophore aerobactin and produce an outer membrane re- 
ceptor, IutA, responsible for its binding and transport (7, 8). 
As in the enterochelin system, both FhuA and IutA require 
TonB function for the energy-dependent step of shuttling 
iron across the outer membrane; both of these systems require 
different genes (fhuBCD) for the subsequent steps of uptake 
(9). 

Transport of vitamin B 12 (cyanocobalamin) is analogous to 
the uptake of iron in that a TonB-dependent outer membrane 
receptor and additional inner membrane proteins are required 
(2, 10). The vitamin B i2 outer membrane receptor, called 
BtuB, also serves as the receptor for the lethal agents, bacte- 
riophage BF23 and the E colicins (11). Whereas transport of 
vitamin B i2 requires TonB function, the binding to BtuB by 
the vitamin, as well as entry of the lethaLagents, does not. 
The suggestion by Heller and Kadner (12) that the amino- 
terminal region of BtuB interacts with TonB provided the 
impetus for this study since the domains of these receptors 
that interact with TonB might have homologous amino acid 
sequences. 

EXPERIMENTAL PROCEDURES 

Plasmids, Bacterial and Phage Strains— Bacteriophages Ml3mp9, 
mp8, mpl9, mpl8 (13), tgl30, and tgl31 (14) and their host JM101 
were from laboratory stock. Plasmid pPCl04, described in Fig. 1, is a 
derivative of pBR322 carrying an 8-kilobase SalhEcoRl fragment 
able to complement entD, fepA, and fes mutations (15). It and the 
fepA deletion strain, UT6900, were kindly provided by Charles Ear- 
hart (University of Texas at Austin). 

Genetic Techniques— Plasmid DNA for subcloning fepA was ob- 
tained from chloramphenicol-amplified cultures by the method of 
Katz et al (16) and purified by CsCl-ethidium bromide equilibrium 
centrifugation. M13 replicative form and template DNA was obtained 
as recommended by the manufacturer of the cloning and sequencing 
kits (Amersham Corp.). Enzymatic digestions and ligations of DNA 
were according to the manufacturer's (Bethesda Research Laborato- 

1 C. F. Earhart, personal communication. 
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Fig. 1. Sequencing strategy and 
restriction map of the fepA gene. 
The plasmids from which DNA frag- 
ments were obtained are shown at top. 
Open bars represent vector DNA. The 
direction and extent of a sequence read- 
ing is shown by the small arrows under 
the restriction map. Arrows starting with 
an X indicate fragments obtained by 
exonuclease III digestion, whereas ar- 
rows which begin with a bar represent 
fragments obtained by restriction endo- 
nuclease cleavage. The large arrow at the 
bottom designates the FepA coding re- 
gion and the direction, although not the 
extent, of transcription. 




ries or New England Biolabs) recommendations. Transformation or 
transfection of bacterial cells with DNA was done by heat shocking 
a mixture of DNA and cells that had been made competent by 
suspension in 0.1 M calcium chloride (17). 

Nucleotide sequence determination was performed by the Sanger 
dideoxy chain-terminating technique (18) from M13 templates using 
deoxyadenosine 5 X[«- M S]thio) triphosphate as described in the 
Amersham sequencing kit. The computer program described by 
Staden (19) was used to analyze sequence data. 

Exonuclease III digestion was used to generate fepA deletions by a 
method similar to that reported by Henikoff (20). The procedure 
involved the initial subcloning of fepA as a 5.4-kilobase Bglll-BamHl 
fragment into pUC8, Subclones with fepA in opposite orientations 
were digested with Pstl and Sail and then with exonuclease III. Since 
£coIII initiates degradation of unpaired four-base 3' ends ineffi- 
ciently, digestion occurs mainly on the 5' overhangs (in this case 
toward fepA). Blunt ends were made by treatment with Si nuclease. 
The DNA was fractionated by electrophoresis on a 0.8% agarose gel 
and various molecular weight fractions were isolated. The ethanol- 
precipitated DNA was ligated with T4 ligase and transformed into 
UT6900. 

RESULTS AND DISCUSSION 

Coderre and Earhart (15) have shown that plasmid pPCl04 
carries the promotor and coding region for FepA. The Bglll- 
BamHl fragment of pPCl04 (Fig. 1) was subcloned into the 
Bam HI site of pUC8 in both orientations. Deletion derivatives 
of these two plasmids were generated via exonuclease III 
digestion. When the deletion plasmids were ordered according 
to size and complementation activity, the position of the fepA 
gene boundaries could be discerned. The restriction maps of 
the parent plasmids and the strategy for nucleotide sequence 
determination are shown in Fig. 1. The position and extent 
of the fepA coding sequence is represented by the large arrow 
at the bottom of the figure. Initially, sequencing was per- 
formed on the Hindlll to EcoRl DNA fragments from the 
deletion plasmids subcloned into phage M13mp8 using the 
dideoxy chain termination method. These data served to 
locate restriction sites which could provide overlapping DNA 
fragments carrying the opposite strand. The particular M13 
vector used was dependent on the restriction fragment to be 
sequenced. All of the fepA coding sequence was determined 
from both strands and readings extending across all restric- 
tion sites were obtained. 

The fepA Gene— Fig. 2 shows the nucleotide sequence of 
fepA, beginning at the Hinfl site 892 base ^-airs to the left 



(Fig. 1) of the insert's unique EcoRl site and extending to the 
Stul site 1,728 base pairs to the right of the EcoRl site. The 
deduced amino acid sequence of the only open reading frame 
long enough to code for a polypeptide of about 81,000 molec- 
ular weight is shown above the nucleotide sequence. The 
direction of transcription and hence of translation is counter- 
clockwise (Le. from lip toward purE) in agreement with the 
results of Fleming et al (21). Possible translation initiation 
codons are found at positions 204, 226, and 271. The third 
ATG is the most likely translation start site for two reasons: 
(i) the succeeding 21 amino acids were typical of a signal 
sequence present on other outer membrane proteins with the 
probable cleavage site after the alanine-glutamine -alanine 

(22) ; and (ii) the sequence preceding this ATG had greater 
complementarity to the 3' end of 16 S ribosomal RNA (Shine- 
Dalgarno sequence) than did the corresponding regions of the 
other two. 

Upstream from the translation initiator site are a number 
of potential promotor -35 and -10 regions. The sequence 
TGACTGCGT, starting at position 160, also occurs upstream 
of fhuA and btuB (see Miniprint Section). 2 Although a typical 
-10 region does not follow 16-19 base pairs downstream, this 
sequence may be the -35 region for these genes or it may be a 
recognition site for a common regulatory effector molecule. 
btuB expression has been observed to be regulated by iron, 3 
as are fepA and fhuA (3, 23). Therefore this sequence could 
be the binding site for an iron repressor protein such as fur 

(23) . A less conserved sequence occurs upstream of iutA but 
it is uncertain whether this gene would have the same regu- 
latory controls since iutA is part of the aerobactin operon and 
its promotor is far upstream of the iutA gene (24). 

The UGA termination codon is followed by a region in 
which either of two stable stem and loop RNA structures can 
form. These inverted repeats are indicated by arrows in Fig, 



2 Portions of this paper (including Tables 1-3 and an additional 
Fig. 1) are presented in miniprint at the end of this paper. Miniprint 
is easily read with the aid of a standard magnifying glass. Full size 
photocopies are available from the Journal of Biological Chemistry, 
9650 Rockville Pike, Bethesda, MD 20814. Request Document No. 
86M-0806, cite the authors, and include a check or money order for 
$1.20 per set of photocopies. Full size photocopies are also included 
in the microfilm edition of the Journal that is available from Waverly 
Press. 

3 K. Heller, personal communication. 
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CACTCCACACCTCTCACTCCTCATTTA^rc<XCTCACACCATA^^ 

* 100 . 

TAATGCCCCCCCTCCAACCCCGCACATTAArTAACCAACTCACTGCCTGT^ 
"TTXXTGCCAAAAATCCACGAATAAAACAATG^ 

S R D D T I V V TAAEONLQAPGVSTI TADE I RKNFVAftnve* i 
TOttTCAaWTACTArWTCGTTACa^CCGACCACAA 

**T"frCVHLTCtlSTSGQRGWKRQIDIRCIlGPEMTLILlr>r 
ATCCGTACCATCCCAGGCCTTAACCTCACCGGTAACTCCACCAGTWrrCAC 

* * 600 

*JV vfisBBSV *0<i''BCERDTBGDT : 6»rVPPEM I ED lEVLRCP 
AACCCCWTAAGCAGCCCTAACTCCGTCCCTCACCWra^ • 

* • 700 

* R A , R T GSGAAGCVVWIITEKGSCEIfBGSlfDAYrNAPEHKE 
CCACGTGCGOGTT ATCGCAACGGCGCGCCCGCCGC CGTGGTTAACATCATT ACCAAAAAAGCCAGOGGCG AGTGG CA CGG CTCCTGGG ACGCATATTTCAATCCGCCM AACATAAAGAG 
* ' 800 * • 

EcoRI 

E C * T I » T V P SLTGPLGOEFSFSLYGNLDKT0ADAWDI HOC 

GAAGG TG CCA CCAAACGCACT AACTTT AGCCTC ACOGGTCCGCTGGGCGACG AATTCACCTTCCGTTTG TATGGCAACCTCG ACAAAA CCCACGCTG ACGCGTGGGAT ATCUCCAGGGC 



Fig. 2. Nucleotide sequence of the 
fepA gene and the deduced amino 
acid sequence of the FepA precur- 
sor. The antisense strand is shown with 
the amino acid sequence, in one-letter 
code, above it. Representative restriction 
sites are indicated by the names above 
their respective recognition sequence. 
The vertical arrow is the putative signal 
sequence cleavage site. Horizontal arrows 
designate a region of dyad symmetry fol- 
lowing the termination codon. 



BOSARAGTYATTLPACREGVIHIDIMCVVHHDPAPLOSLV 
CATCAGTCtXCCCC^CCMAACCTATGCCACCACGTTACCAGCCCGGM 

LEAGtSROCNLYAGDTQHTNSDGYTRSKYGDETUBl V D n u 
CTGG AAGCAGCTT ACAG CCGCCAGGGTAACCTGT ATGCGGCCC ACACCCAG AATACCAA CTCOT ATTCCTAT ACCCCCTCG AAAT ATGGCG ATGAAACCAACCCTCTCT ATCGCCAGAAC 
UC0 .... 1!00 

KpnJ 

YALTVriCGNDNCVTTSNWVQYEHTftNSRIFEGLACGTECK 

tacgccctgacctggaac«:tcgctgggataac«:cctiuccaccaccaactgk 

* . 1300 

TTTAAC(»AA^CC^CACACGATTTO1^TATCGA 



K N 0 0 R H X 



QALTCTNTGCAIDCVSTTDBSPTS 



K A E I 



FAEHNHELTDS 



C L R F D 



BSIVGNNN'6 



A L N I S 



0 G L G 



D F T L K ft G 



ARAYIAPSLYOTNPKYILYSKGQG.C 



CAACGTTtAGCCC^TCACTTCACGCTGAAAATGGCCATCCCCtXITCCTtATAAAGCGC^ 



SAGGCYLQGM 



O L K A E 



SINKEIGLEFK 



ACCCCGGGOKCTCCTATCTGCAAGGTAACGATCACCTG AAAGCACAAACCAG CATCAA CAAAG AGATTGGTCTCGAGTTCAAACC 



NDYSKK I EAGYVAVCQ 



AVGTDLYQWDNVPKAV 



B G L E C S 



S E T V ft 



1 T Y K L K S 



G O R L 6 I 



P E Y T L N S 



0 A * E D L 5 



0 T T T r 



YGIOOPICKYMYICQPAVCPETI 
— ' JCTTGGACCOGAAACCAAA 



EISPY6IVGLSAT 



N V S L T C C 



DNLFDIRLWRACNA 



GAAATrACTCCTTACACCATTCTTO^CTCAGCCCGACCTGCGATCYCACGAAGAATC 

2300 ' • • . 2400 



OTTGDLAGANYI AGACAYTY 



E P G F T 



U T B F ' 



CACACCACCWCCATTTGGCACGGCCCAACTATATCCCCGCtCCCG^^ 

' * • 1500 • 



■ ■ Stid 

ATTGTTCACAAACTCCGCCTCCTrCATCCCGCATCCGCQCTGAAIBCOT 

* * 2600 * 



2 and the structures they can form are presented in the 
Miniprint Section. Because neither of these structures are 
followed by a run of Ts, as is the case for rho-independent 
terminators, it is likely that this region is a rho factor- 
dependent transcription terminator for the fepA gene. The 
rho-dependent terminators for At R1 and tRNA-Tyr are also 
deficient in Ts at the 3' ends, whereas terminators in which 
a stem loop structure is followed by 3 to 7 T residues can 
function independently of rho (25). In addition, following the 
stem loop structures are the sequences CAAAAA and 
CAATTCAA which are similar to the sequences CAAAAG 
and CAAUCAA found at or near the 3' end of several rho- 
dependent transcripts (26). 

Features of the FepA polypeptide— The predicted FepA pro- 
tein consists of 723 amino acids preceded by a 22-residue 



leader or signal peptide. The molecular weight of the mature 
protein was calculated to be 79,908 in agreement with the 
value determined by sodium dodecyl sulfate-polyacrylamide 
gel electrophoresis. Similar to other outer membrane proteins 
the number of charged residues in FepA were 21% of the 
total. The mean hydropathy calculated according to Kyte and 
Doolittle (27) was -0.64; by comparison that of BtuB was 
-0.55. Hydrophobic amino acids were fairly evenly dispersed 
throughout the polypeptide as were charged residues; no un- 
charged regions of sufficient length to span the outer mem- 
brane as an a-helix were present. With respect to net charge, 
FepA (net charge of -14) was like the porins (OmpF, -11; 
OmpC, -14; and PhoE, -9) (28) and the TonB -dependent 
receptors (BtuB, -14; FhuA, -16; and IutA, -13). EspA has 
2 cysteines, FhuA has 4, but the porins and the other two 
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TonB-dependent receptors have none. Like BtuB the distri- 
bution of tyrosine residues in FepA was not even. The amino- 
terminal end of FepA was devoid of tyrosines to residue 133, 
whereas clusters of tyrosines occurred in the carboxyl-termi- 
nal half of the protein. 

Codon usage (see Miniprint Section) was typical of that of 
a strongly expressed protein since codons recognized by the 
most abundant tRNA species were preferred (29, 30). This 
was in agreement with the observation that the derepressed 
level of FepA in the outer membrane can rival the porin 
content (3). 

Homology among TonB-dependent Receptors — Fig. 3 pre- 
sents the sequences in the four regions of significant amino 
acid homology shared by the TonB-dependent receptors, 
FepA, BtuB, FhuA, and IutA. Region I includes the amino- 
termini of the four proteins. In cells carrying the btuB451 
mutation, vitamin B a2 uptake activity is abolished while other 
receptor functions are normal, possibly owing to failure of the 
altered receptor to interact with TonB. The mutation in the 
btuB451 allele changed the eighth amino acid of the mature 
BtuB from leucine to proline (12). Fig. 3 shows that this 
leucine residue is near the center of homology region I. The 
finding that the amino- termini of these four TonB-dependent 
receptors are homologous supports the proposal that this 
region is important for proper interaction with TonB or 
energy transduction. 

Homology region II is near the carboxyl terminus and is 
the most highly conserved of the four homology regions but 
is also the shortest. 

Region III is the least conserved of these sequences; how- 
ever, like region IV, the homology extends for a greater length 
than does that of regions I or II. In positions where amino 
acid residues are not identical, conservative replacements 
were frequently observed. 



Comparison of the nucleotide sequences encoding the ho- 
mologous regions (not shown) revealed that the sequences 
were disparate, suggesting that these proteins did not arise 
from a common ancestor but rather resulted from convergent 
evolution. Functions of the homologous regions could include 
interaction with TonB, export and localization of the proteins 
to the outer membrane, interaction with lipids or lipopolysac- 
caride, or involvement in the ligand transport mechanism, 
possibly by channel formation. No substantial regions of. 
homology among the four TonB-dependent receptors and the 
outer membrane proteins OmpC, PhoE, and LamB were found 
by the homology search program, FASTP, of Lipman and 
Pearson (32). Very short regions of homology between FepA 
and OmpF, and FepA and OmpA were found. Homology with 
OmpF was found slightly downstream from region IV. The 
region of FepA homologous to a portion of OmpA is about 70 
amino acids upstream of region II. In OmpA this area of 
homology is slightly upstream of the region identified by 
Nikaido and Wu (33). Other homology regions described by 
Nikaido and Wu for the porins, LamB, and OmpA were not 
apparent in FepA. Thus these receptors can be considered as 
a distinct class of outer membrane proteins in which amino 
acid sequence homology occurs only in a few segments. Inves- 
tigations of the function of these conserved regions should 
prove interesting. 
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Nucleotide Sequence of the Ferri-enterochel in Receptor FepA: Homology Among 
Escherichia col t Outer Membrane Becepton Which Interact with TonB. 
by 

Michael D. Lundrigan and Robert J. Kadner 

TABLE 1. A common sequence in the upstream region of f epA . btnB . fhnA . and 
jLOlA. The number above the last T it the distance from the translation start 
codon in nucleotides. 

-102 

f epA TGACTGCGT 
-161 

btnB TGATTGCGT 



fhoA AGATTGCGT 



-144 

lot A TAATOGOGT 



TABLE 2. Amino acid composition of the FepA polypeptide. Parentheses 
indicate the amino acid compositions of the signal sequence. 

Amino Acid Number of Residues 



Ala 
Ar« 
Asn 
Asp 
Cy. 
Gin 
Glu 
Gly 
Bis 
He 
Uu 
Lys 
Met 
Pbe 
Pro 
Ser 
Thr 
Trp 
Tyr 
Val 



50 (3) 
39 

56 (2) 
45 
2 

31 (1) 
39 

74 (2) 

10 (1) 
35 (2) 
50 (4) 
31 (2) 

11 (1) 
17 

29 

48 (1) 

65 

20 

31 (1) 
40 <2) 



Fig. 1. Alternative stem-and-loop structures in the terminator region of 
t epA . A free energy of -11.4 keel and -20.4 heal was calculated for 
structures I and II respectively by the method of Tinoco et al. (36). 
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G-D 

U 

D-A 
A-D 
G°C 
G-C 
O-G 
0=C 
G-C 

GUDCAD C0ACAAAAA0CUUG 



Number of Residues 723 

Molecular Weight 79,908 

% Charged Residues 21 

Net Charge -14 

Mean Hydropathy -0.64 



TABLE 3. Codon usage for the f cnA gene, 
for the signal sequence. 



Parentheses indicate the codons used 



PUD 


3 




DCD 


0 




DAD 22 


<1> 


DSD 0 


DUC 


14 




DCC 


7 


(1) 


DAC 9 




CGC 2 


UUA 


5 




UCA 


3 




DAA 0 




DGA 1 


DUG 


8 


(2) 


DOS 


11 




DAfl 0 




OSG 20 


CUD 


1 




CCD 


4 




CAD 4 


(1) 


COD 23 


cue 


4 




CCC 


0 




CAC 6 




CGC 11 


CDA 


0 




CCA 


6 




CAA 7 




OSA .2 


COG 


32 


(2) 


COS 


19 




CAG 24 


(1) 


COG 3 


AOU 


19 


(2) 


A CD 


7 




AAD 12 


(1) 


AGO 8 


ADC 


15 




ACC 


38 




AAC 44 


(1) 


AGC 19 


ADA 


1 




ACA 


1 




AAA 22 




AG A 0 


AUG 


11 


(1) 


AOS 


19 




AAG 9 


(2) 


AGG 0 


GCD 


13 




GCD 


3 




GAD 28 




GGU 24 


GUC 


6 


(1) 


GCC 


14 


(1) 


GAC 17 




GGC 37 


GUA 


5 


U) 


GCA 


10 




GAA 28 




GGA 3 


GDG 


16 




GOG 


23 


(2) 


GAG 11 




GGG 8 (2) 



